Optimal High Dimensional Multiple Testing Under Linear Models

نویسندگان

  • Jichun Xie
  • Zhigen Zhao
چکیده

High dimensional multiple testing has many important applications. Motivated by genome-wide association studies (GWAS), we consider the problem of mulitiple testing under high dimensional sparse linear model in order to identify the genetic markers associated with the trait of interest. The model is an extension of the normal mixture model under arbitrary dependence. We propose a multiple testing procedure, which ranks and thresholds the adjusted z-surrogate. It is shown that the procedure can control mfdr level and minimize mfnr level asymptotically among all the methods based on the original data. Numerical results show that our method performs well under linear models with correlated predictors. The procedure is further illustrated through an analysis of a genome-wide association study in hypertension. At mfdr level equal to 0.05, it identifies 11 genetic markers assciated with systolic blood pressure and 11 associated with diastolic blood pressure. Many of the markers are located in the regions associated with human blood pressure based on the Rat Genome Database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

OPTIMUM GENERALIZED COMPOUND LINEAR PLAN FOR MULTIPLE-STEP STEP-STRESS ACCELERATED LIFE TESTS

In this paper, we consider an  i.e., multiple step-stress accelerated life testing (ALT) experiment with unequal duration of time . It is assumed that the time to failure of a product follows Rayleigh distribution with a log-linear relationship between stress and lifetime and also we assume a generalized Khamis-Higgins model for the effect of changing stress levels. Taking into account that the...

متن کامل

EVALUATION OF CONCRETE COMPRESSIVE STRENGTH USING ARTIFICIAL NEURAL NETWORK AND MULTIPLE LINEAR REGRESSION MODELS

In the present study, two different data-driven models, artificial neural network (ANN) and multiple linear regression (MLR) models, have been developed to predict the 28 days compressive strength of concrete. Seven different parameters namely 3/4 mm sand, 3/8 mm sand, cement content, gravel, maximums size of aggregate, fineness modulus, and water-cement ratio were considered as input variables...

متن کامل

A one-dimensional model for variations of longitudinal wave velocity under different thermal conditions

Ultrasonic testing is a versatile and important nondestructive testing method. In many industrial applications, ultrasonic testing is carried out at relatively high temperatures. Since the ultrasonic w...

متن کامل

Two-Sample Tests for High-Dimensional Linear Regression with an Application to Detecting Interactions.

Motivated by applications in genomics, we consider in this paper global and multiple testing for the comparisons of two high-dimensional linear regression models. A procedure for testing the equality of the two regression vectors globally is proposed and shown to be particularly powerful against sparse alternatives. We then introduce a multiple testing procedure for identifying unequal coordina...

متن کامل

THE CAPABILITY OF OPTIMAL SINGLE AND MULTIPLE TUNED MASS DAMPERS UNDER MULTIPLE EARTHQUAKES

The main focus of this research has been to investigate the effectiveness of optimal single and multiple Tuned Mass Dampers (TMDs) under different ground motions as well as to develop a procedure for designing TMD and MTMDs to be effective under multiple records. To determine the parameters of TMD and MTMDs under multiple records various scenarios have been suggested and their efficiency has be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013